Corpus: msa_wikipedia_2018_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 2005 m-
2 1852 p-
3 1452 S-
4 1384 d-
5 1321 k-
Top Character Bigrams
word rank frequency n-gram
1 1562 me-
2 1326 pe-
3 1086 di-
4 831 be-
5 748 ke-
Top Character Trigrams
word rank frequency n-gram
1 932 men-
2 730 ber-
3 566 pen-
4 358 per-
5 338 mem-
Top Character 4-Grams
word rank frequency n-gram
1 384 meng-
2 246 peng-
3 146 meny-
4 123 memb-
5 93 peny-
Top Character 5-Grams
word rank frequency n-gram
1 90 menge-
2 79 menga-
3 71 mengh-
4 69 menye-
5 68 penga-
223 msec needed at 2024-03-30 02:04